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We prove a Fife-like characterization of the infinite binary ^ -power- free words, by giving a finite 
automaton of 15 states that encodes all such words. As a consequence, we characterize all such 
words that are 2-automatic. 

1 Introduction 

An overlap is a word of the form axaxa, where a is a single letter and x is a (possibly empty) word. In 
1980, Earl Fife O proved a theorem characterizing the infinite binary overlap-free words as encodings of 
paths in a finite automaton. Berstel [T] later simplified the exposition, and both Carpi ||6l and Cassaigne 
||71 gave an analogous analysis for the case of finite words. 

In a previous paper [13], the second author gave a new approach to Fife's theorem, based on the 
factorization theorem of Restivo and Salemi |[T2l for overlap-free words. In this paper, we extend this 
analysis by applying it to the case of ^-power-free words. 

Given a rational number ^ > 1, we define a word w to be a ^-power if w can be written in the form 
x"x' where n = \_plq\, x' is a (possibly empty) prefix of x, and |w|/|x| = p/q. The word x is called a 
period of w, and is an exponent of w. li p/q is the largest exponent of w, we write exp(w) = p/q. 
We also say that w is \x\-periodic. For example, the word alfalfa is a |-power, and the corresponding 
period is alf . Sometimes, as is routine in the literature, we also refer to [a'| as the period; the context 
should make it clear which is meant. 

A word, whether finite or infinite, is ^-power-free if it contains no factor w that is an a-power for 
a > j8. A word is -power-free if it contains no factor w that is an a-power for a > j8. Thus, the 
concepts of "overlap-free" and "2+ -power-free" coincide. 

2 Notation and basic results 

Let £ be a finite alphabet. We let £* denote the set of all finite words over £ and E'*^ denote the set of all 
(right-) infinite words over £. We say y is n factor of a word w if there exist words x, z such that w = xyz. 
If X is a finite word, then x^ represents the infinite word xxx • • • . 

P. Ambroz, S. Holub and Z. Masakova (Eds.): 

8th International Conference WORDS 201 1 

EPTCS 63, 2011, pp. 1894T98I doi: 1 0.4204/EPTCS.63.25I 



190 



Fife's Theorem fori /3-Powers 



From now on we fix E = {0, 1}. The most famous infinite binary overlap-free word is t, the Thue- 
Morse word, defined as the fixed point, starting with 0, of the Thue-Morse morphism /i, which maps 
to 01 and 1 to 10. We have 

t = totit2 • • • = 01 101001 100101 10 • • • . 

The morphism /i has a second fixed point, t, which is obtained from t by applying the complementation 
coding defined by = 1 and 1=0. 

We let =^7/3 denote the set of (right-) infinite binary | -power-free words. We point out that these 
words are of particular interest, because | is the largest exponent a such that there are only polynomially- 
many a-power-free words of length n\9\. The exponent | plays a special role in combinatorics on words, 
as testified to by the many papers mentioning this exponent (e.g., |[T0l[T4l l9l[Tn[ni5l'). 

We now state a factorization theorem for infinite ^-power-free words: 

Theorem 1. Let x € =^7/3, and let P = {po,pi,p2,P3,P4.}, where po = e, p\ = 0, p2 = 00, pj, = 1, and 
P4 = 11. Then there exists y E ^7/3 and p £ P such that x = /J/l(y). Furthermore, this factorization is 
unique, and p is uniquely determined by inspecting the first 5 letters ofx. 

Proof. The first two claims follow immediately from the version for finite words, as given in ||9]- The 
last claim follows from exhaustive enumeration of cases. □ 

We can now iterate this factorization theorem to get 
Corollary 2. Every infinite Ypower-free word x can be written uniquely in the form 

^ = Pi,KPi2KPhl^{---))) (1) 

with ij G {0, 1,2,3,4} for j > I, subject to the understanding that if there exists c such that ij = Q for 
j > c, then we also need to specify whether the "tail" of the expansion represents /l'*'(0) = t or /x'*'(l) = t. 
Furthermore, every truncated expansion 

Ph^ {Pi2 M (Ph ^{■■■Pi„-,^{Pi.,) ■■■))) 

is a prefix of x, with the understanding that if in = 0, then we need to replace pi^ with either \ (if the 
"tail" represents i) or 3 (if the "tail" represents i). 

Proof. The form ([T} is unique, since each pi is uniquely determined by the first 5 characters of the 
associated word. □ 

Thus, we can associate each infinite binary ^-power-free word x with the essentially unique infinite 
sequence of indices i := {ij)j>o coding elements in P, as specified by ([T]). If i ends in 0®, then we need 
an additional element (either 1 or 3) to disambiguate between t and t as the "tail". In our notation, we 
separate this additional element with a semicolon so that, for example, the string 000 • • • ; 1 represents t 
and 000 • • ■ ; 3 represents t. 

Of course, not every possible sequence of {ij)j>\ of indices corresponds to an infinite |-power- 
free word. For example, every infinite word coded by an infinite sequence beginning 400 • • • has a 
^ -power. Our goal is to characterize precisely, using a finite automaton, those infinite sequences corre- 
sponding to ^-power-free words. 

Next, we recall some connections between the morphism jj. and the powers over the binary alphabet. 
Below X is an arbitrary finite or right-infinite word. 

Lemma 3. If the word /i(x) has a prefix zz, then the word x has the prefix /i^^(z)/i^^ (z). 
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Proof. Follows immediately from ||3] Lemma 1.7.2]. 



□ 



Lemma 4. (1) For any real ^ > I, we have exp(jc) = j3 iffexp{iJ.{x)) = j3. 
(2) For any real (5 > 2+, the wordx is ^-power-free iff }x{x) is ^-power-free. 



Proof For (1), see HU Prop. 1.1]. For (2), see lE] Prop. 1.2] or [9. Thm. 5]. 



□ 



Lemma 5. Let p be a positive integer If the longest p-periodic prefix of the word has the exponent 
j8 > 2, then the longest {p/ 2) -periodic prefix ofx also has the exponent j8. 

Proof. Let zzz' (where |z| = p and z' is a possibly empty prefix of z®) be the longest /^-periodic prefix of 
H{x). Lemma [3] implies that p is even. If \z'\ is odd, let a be the last letter of z'. The next letter b in /x(x) 
is fixed by the definition of jj.: b ^ a. By the definition of period, another a occurs p symbols to the left 
of the last letter of z'. Since p is even, this a also fixes the next letter b. Hence the prefix zzz'b of iJ.{x) 
is p-periodic, contradicting the definition of zzz'. Thus \z'\ is even. Therefore x begins with the j3 -power 
of period p/2. 

It remains to note that if x has a (/7/2) -periodic prefix y of exponent a > j8, then by Lemma|4](l), the 
p-periodic prefix iJ.{y) of /i(jc) also has the exponent a, contradicting the hypotheses of the lemma. □ 

3 The main result 

For each finite word w G {0, 1,2,3,4}*, w = i\i2 ■ --ir, and an infinite word x G {0, 1}'*', we define 



Lemma 6. The sets F^ satisfy the equalities listed in Fig. [7] In particular, there are only 15 different 
nonempty sets F„; they are 



Proof. Due to symmetry, it is enough to prove only the 30 equalities from the upper half of Fig. [T] and 
the equality Fq = Fg. We first prove the emptiness of 15 sets from the upper half of Fig. [T] 

Four sets: F21, F22, F201, and F202, consist of words that start 000. 

Eight sets consist of words that contain the factor 0/^(11) = 01010 (F14, F24, F133, F134, F1303, and 
^1304)> its /I -image (F114), or the complement of its /i -image (F132). 

Two sets: F2031 and F2032, consist of words that start 00/^^(1)0 = 0010010. Finally, the words from 
the set F200 have the form 00/i^(x); each of these words starts either 000 or 0010010. 

Each of the 16 remaining equalities has the form F„^ = F„^. We prove them by showing that for 
an arbitrary x G ^7/3, the words u\ = Ch, (x) and U2 = Cw2{^) are either both ^-power-free or both 
not. In most cases, some suffix of mi coincides with the image of U2 under some power of }JL. Then by 
Lemma |4](2) the word u\ can be ^-power-free only if ui is ^-power-free. In these cases, it suffices to 
study u\ assuming that U2 is ^-power-free. 

When we refer to a "forbidden" power in what follows, we mean a power of exponent > |. 

Fo = Fg: By Lemma|4](2), u\ = /x(x) is |-power-free iff M2 = x is ^-power-free. 



Note that 




Fe,F\,F\\,F\i,,F\T,o,F2, F20 , F203 , F3 , F3 1 , F3 10 , F33 , F4 , F40 , F40 1 . 
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Figure 1: Equations between languages F„ 
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^10 = ^e- The word ui = 0/i(/i(x)) contains a /i^-image of U2 = x. If x is |-power-free, then so is 
/x^(x). Hence, if ui has a forbidden power, then this power must be a prefix of mi . 

Now let j8 < 7/3 be the largest possible exponent of a prefix of x and q be the smallest period of a 
prefix of exponent j8 in x. Write ^ = p/q. Then the word /x^(x) has a prefix of exponent j8 and of period 
Aq by Lemma|4](l), but no prefixes of a bigger exponent or of the same exponent and a smaller period by 
Lemma [5] Hence u\ has no prefixes of exponent greater than (4/7 + 1)/ (4^). Since p and q are integers, 
we obtain the required inequality: 

P ^ ,3^ 4/7 + 17 

^ < - ^ 3p < 7^ ^ 3/7 + - < 7^ ^ -L- < - . 

q 3 4 Aq 3 

Fi2 = F2: The word mi = 0/x(00/i(x)) contains a /x-image of = 00/x(x). Suppose that U2 is 
power-free. Then it starts 001001 1. Since the factor 001001 cannot occur in a jU -image, we note that 

(*) the word 00/i(x) has only two prefixes of exponent 2 (00 and 001001) and no prefixes of bigger 
exponents. 

By Lemma[5l the word ^11(^2) has only two prefixes of exponent 2 (;U(00) and /x(OOlOOl)) and no 
prefixes of bigger exponents. Thus, the word ui = 0/x(m2) is obviously |-power-free. 

F23 = ^13: We have ui = 00/i(l/x(x)) = 0m2- Suppose the word U2 is ^-power-free; then it starts 
010011. A forbidden power in u\, if any, occurs at the beginning and hence contains 0010011. But 
00100 does not occur later in this word, so no such forbidden power exists. 

^110 = ^3: The word ui = 0/i(0/x(/x(x))) = 001/i^(x) is a suffix of the /i^-image of U2 = lAt(x). 
Hence, if U2 is | -power-free, then by Lemma 51(2) mi is ^-power-free as well. 

For the other direction, assume U[ is |-power-free and then /i(x) is ^-power-free. So, if U2 contains 
some power yyy' with \y'\ > \y\/3, then this power must be a prefix of U2- Put y = Iz and y' = Iz'. The 
word jU(x) starts zlzlz'. Hence x starts /i^' (zl)/i^^ (zl) by LemmalU So we conclude that |zl[ = \y\ is 
an even number. Now let \y\ = q and p = \yyy'\ so that p/q > 7/3. Thus the word lui = /i^(m2) starts 
with a (/7/^)-power of period Aq. Since ui is ^-power-free, we have {Ap—l)/Aq < 7/3. 

This gives us the inequalities 3p > Iq and 3/7 — 3/4 < Iq. Since p and q are integers this means 
3p = 7q and hence q is divisible by 3. On the other hand from above q is even. So q is divisible by 6. 
Now \y'\ = \y\/3 so \y'\ is even. But then z' is odd, and begins at an even position in a /i -image, so the 
character following z' is fixed and must be the same character as in the corresponding position of z, say 
a. Thus zlzlz'a is a (7/3)-power occurring in /i(x), a contradiction. 

^111 =^11: The word Ml =0/x(0/l(0/x(x))) =00101 10/^^ (x) contains a /i -image of M2 = 0/x(0/i(x)) = 
001/x^(x). Suppose U2 is |-power-free but to the contrary ui = 0/i(m2) has a forbidden power. By 
Lemma |4](2), this power must be a prefix of ui. Note that this power can be extended to the left by 1 
(not by 0, because a jLt -image cannot contain 000). Hence the word lui = /x^(lx) starts with a forbid- 
den power. This induces a forbidden power at the beginning of Ix; this power has a period q and some 
exponent p/q > 7/3. Then ui has a prefix of exponent (8/7 — l)/Sq > 7/3. On the other hand the word 
IjL{u2) is ^-power-free, whence (8/7 — 2)/q< 7/3. So we get the system of inequalities 3/7 — 3/8 > Iq, 
3p — 3 /A <lq. This system has no integer solutions, a contradiction. 

Fn2 = F2. We have m = 0li{0li{QOpi{x))) = = 00\ii^{u2). In view of {-k), one can 

easily check that if U2 is ^-power-free, then so is mi. 

F113 = We have ui = 0/x(0/i(l/^(x))) = 0/x(m2). Suppose U2 is ^-power-free. Then /i^(x) 
starts 01101001. Assume to the contrary that u\ has a forbidden power. By Lemma |4](2), this power 
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must be a prefix of mi. Again, this power can be extended to the left by 1, not by 0. Hence the word 
1mi = /x^(ll/x(x)) starts with a forbidden power, thus inducing a forbidden power at the beginning of 
M = 1 l/i(x) = 110110- • • . The word u has only two squares as prefixes (11 and 1 10110, cf. (★)). Hence u 
has the prefix 1 101 1010 and no forbidden factors except for the (7/3)-power prefix 1 101 101. Therefore, 
the word ui has no prefixes of exponent > 7/3. 

^131 = ^3i: We have ui = 0/x(l/i(0/x(x))) = 0/x(m2)- Suppose is ^-power-free. Then the word 
u = ll/x(0/i(x)) is ^-power-free by the equality F41 = F31, which is symmetric to F23 = F13 proved 
above. But ui is a suffix of whence the result. 

^204 = F4: We have ui = 00/x(/x(ll/i(x))) = 00/x^(m2). Suppose U2 is ^-power-free. Using the 
observation symmetric to (★), we check by inspection that ui contains no forbidden power. 

■^1300 =Fuo- Neither one of the words ui = 0;U(l/i(/i(/i(x)))) = 010/x'^(x), U2 = 0/x(l/x(;U(x))) = 
010/x^(x) contains an image of the other. The proofs for both directions are essentially the same, so we 
give only one of them. Let ui be ^-power-free; then the words /x'*(x), x, and /^^(x) are ^-power-free as 
well, and x starts 0. A simple inspection of short prefixes of U2 shows that if this word is not |-power- 
free, then some jS -power with j3 > 2 is a prefix of /i^(x). By Lemma [51 the word x also starts with a 
j8 -power. The argument below will be repeated, with small variations, for several identities. 

(*) Consider a prefix yyy' of x which is the longest prefix of x with period \y\. Then \y'\ < \y\/3. 
By Lemma[5l the longest prefix of the word /^^(x) having period S\y\ is j^^iyyy'). If some word 
zlJ.^{yyy') also has period S\y\, then z should be a suffix of a /^^ -image of some word. Since 
the word 010 is not such a suffix, then lOiJ.^{yyy') is the longest possible (8|y|)-periodic word 
contained in U2- Let us estimate its exponent. Since |z| and \y\ are integers, we have 

8iy 1 < S\y\/3 =^ 24\y\ < 81^1 ^ 241/1 + 6 < S^l =^ 81/1+2 < 8^1/3, 

whence exp {10 jj.^ {yyy')) < 7/3. Since we have chosen an arbitrary prefix yyy' of x, we conclude 
that the word U2 is ^-power-free. 

^1301 =^1: The word Ml = 0;U(l/x(;U(0;U(x)))) = 010/i^(0/x(x)) contains a /i^-image of M2 = 0/x(x). 
Suppose U2 is ^-power-free. It suffices to check that the prefix 010 of ui does not complete any prefix of 
jU^(m2) to a forbidden power. For short prefixes, this can be checked directly, while long prefixes that can 
be completed in this way should have exponents > 2. By Lemma [5j a prefix of period p and exponent 
j8 > 2 of the word jU^(m2) corresponds to the prefix of the word U2 having the exponent jS and the period 
p/S. So, we repeat the argument (*) replacing x with U2 to obtain that ui is |-power-free. 

^1302 = F2: We have ui = Oh{\Ijl{h{OOix{x)))) = 010^4^(00^4 (x)) = 010a4^(m2)- Suppose U2 is ^- 
power-free. By (★) and Lemma[5l among the prefixes of /x^(m2) there are only two squares, /i^(OO) and 
jU^(OOlOOl), and no words of bigger exponent. By direct inspection, ui is (7/3)-free. 

^2030 = ^3io: Neither one of the words ui = 00/i(/x(l;U(;U(x)))) = 001001/x'*(x) and 

U2 = 1A4(OA4(A4(x))) = 10lA4^(x) 

contains an image of the other. If the word ui is assumed to be | -power-free, then the proof repeats 
the proof of the identity F1300 = Fi3o> up to renaming all O's to I's and vice versa. Let U2 be |-power- 
free. The words /x^(x) and x are also ^-power-free, and x begins with 1, assuring that there are no short 
forbidden powers in the beginning of ui. Concerning long forbidden powers, we consider, similar to (*), 
a prefix yyy' of x which is the longest prefix of x with period \y\. The longest possible (16|j|)-periodic 
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word contained in ui is OlOOljj.'^ {yyy'), because 001001 is not a suffix of a /I'^-image. As in (*), we 
obtain 16|y| +5 < 16|>'|/3, implying exp(01001/x^(jjy)) < 7/3. Hence the word ui is ^-power-free. 

F2033 = F33: The word ui = 00/i(/i(ljU(l/i(;c)))) = 00/^^(1 10/i^(x)) contains a /x^-image of U2 = 
l/x(l/x(;c)) = 1 10n^{x). Again, if the word is ^-power-free, then so is /i^(m2), and it suffices to check 
that the prefix 00 of ui does not complete any prefix of /i^(m2) to a forbidden power. Similar to (*), 
consider a prefix yyy' of U2 which is the longest prefix of U2 with period \y\. The longest possible (413^1)- 
periodic word contained in ui is OiJ.^{yyy'), because 00 is not a suffix of a /i^-image. As in (*), we see 
that 4\y'\ + 1 < 4|j|/3, implying exp(0/i^(yyy')) < 7/3, and conclude that the word ui is ^-power-free. 

^2034=^4: The word Ml = 00/x(/x(l/i(ll/x(x)))) = 001001/x^(ll/x(x)) contains a /i'' -image of M2 = 
ll/i(x). Suppose U2 is | -power-free. Using (*) and Lemma |5l we conclude that among the prefixes of 
jU^(m2) there are only two squares, /i^(ll) and /^^(IIOIIO), and no words of bigger exponent. By direct 
inspection, ui is (7/3) -free. □ 
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Figure 2: Automaton coding infinite binary |-power-free words 

From Lemma |6] and the results above, we get 

Theorem 7. Every infinite binary ^-power-fi-ee word x is encoded by an infinite path, starting in F^, 
through the automaton in Figure^ 

Every infinite path through the automaton not ending in 0"' codes a unique infinite binary ^-power- 
free word X. If a path i ends in 0'*' and this suffix corresponds to a cycle on state Fg then x is coded by 
either i; 1 or i;3. If a path i ends in O'*' and this suffix corresponds to a cycle on F310, then x is coded by 
i; 3. If a path i ends in 0® and this suffix corresponds to a cycle on F\^q, then x is coded by i; 1. 

Remark 8. Blondel, Cassaigne, and Jungers Q obtained a similar result, and even more general ones, 
for finite words. The main advantage to our construction is its simplicity. 
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Corollary 9. Each of the 15 sets F^, Fy, F2, F^, F4, Fn, F33, F13, F31, F20, F^q, Fi^q, F310, F203, -f^oi is 
uncountable. 

Proof. It suffices to provide uncountably many distinct patlis from each state to itself. By symmetry, it 
suffices to prove this for all the states labeled e or below in Figure |2] These are as follows: 

• e: (0+10)'" 

• 1: (01+001)'^ 

• 2: (0402 + 030402)'" 

• 11: (0011+00011)'" 

• 13: (013 + 0013)® 

• 20: (4020 + 34020)'" 

• 401: (10401+203401)'" 

• 130: (0+104010)'". 

□ 

Corollary 10. For all words w G {0, 1 , 2, 3,4}*, either F^ is empty or uncountable. 

4 The lexicographically least | -power-free word 

Theorem 11. The lexicographically least infinite binary ^-power-free word is OOlOOlt. 

Proof. By tracing through the possible paths through the automaton we easily find that 2030*"; 1 is the 
code for the lexicographically least sequence. □ 

Remark 12. This result does not seem to follow directly from as one referee suggested. 

5 Automatic infinite binary ^-power-free words 

As a consequence of Theorem |7J we can give a complete description of the infinite binary |-power- 
free words that are 2-automatic [3 |. Recall that an infinite word (a„)„>o is ^-automatic if there exists 
a deterministic finite automaton with output that, on input n expressed in base k, produces an output 
associated with the state last visited that is equal to a„. Alternatively, (a„)„>o is /c-automatic if its k- 
kemel 

{{ak-n+j)n>o ■ / >OandO< j</c'} 
consists of finitely many distinct sequences. 

Theorem 13. An infinite binary ^-power-free word is 2-automatic if and only if its code is both specified 
by the DFA given above in Figure^ and is ultimately periodic. 

First, we need a lemma: 

Lemma 14. An infinite binary word x = aQa\a2 ■■■ is 2-automatic if and only if }x{x) is 2-automatic. 
Proof. Proved in [13 1. □ 
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Now we can prove Theorem [13] 

Proof. Suppose the code of x is ultimately periodic. Then we can write its code as jz® for some finite 
words y and z- Since the class of 2-automatic sequences is closed under appending a finite prefix |[3] 
Corollary 6.8.5], by Lemma [141 it suffices to show that the word coded by is 2-automatic. 

The word z'" codes a | -power-free word w satisfying w = t(p{w), where f is a finite word and (p = /i*^. 
Hence, by iteration, we get that w = t(p{t)(p^{t) ■ ■ ■ . It is now easy to see that the 2-kemel of w is 
contained in 

S := {un'{v)n'^\v)n'^^''{v) ■■■ : \u\ < \t\ and v G {t,t} and 1 < / < k}, 
which is a finite set. 

On the other hand, suppose the code for x is not ultimately periodic. Then we show that the 2-kemel 
is infinite. Now it is easy to see that if the code for x is ay for some letter a € {0, 1,3} then one of the 
sequences in the 2-kernel (obtained by taking either the odd- or even-indexed terms) is either coded by y 
or its complement is coded by y. On the other hand, if the code for x is ay with a G {2,4}, then y begins 
with 0, 1, or 3, say y = bz. It follows that the subsequences obtained by taking the terms congruent to 
0, 1,2, or 3 (mod 4) is coded by z, or its complement is coded by z. Since the code for x is not ultimately 
periodic, there are infinitely many distinct sequences in the orbit of the code for x, under the shift. By the 
infinite pigeonhole principle, infinitely many correspond to a sequence in the 2-kemel, or its complement. 
Hence x is not 2-automatic. □ 
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